Fast Description-Oriented Community Detection using Subgroup Discovery

نویسندگان

  • Martin Atzmueller
  • Stephan Doerfel
  • Folke Mitzlaff
چکیده

Communities can intuitively be defined as subsets of nodes of a graph with a dense structure. However, for mining such communities usually only structural aspects are taken into account. Typically, no concise and easily interpretable community description is provided. For tackling this issue, we focus on fast description-oriented community detection using subgroup discovery, cf. [1, 2]. In order to provide both structurally valid and interpretable communities we utilize the graph structure as well as additional descriptive features of the contained nodes. A descriptive community pattern built upon these features then describes and identifies a community given by a set of nodes, and vice versa. Essentially, we mine for patterns in the “description space” characterizing interesting sets of nodes in the “graph/community space”; the interestingness of a community is then evaluated by a selectable quality measure. We aim at identifying communities according to standard community quality measures, while providing characteristic descriptions of the respective communities at the same time. In order to implement an efficient approach, we propose several optimistic estimates of standard community quality functions. Together with the proposed exhaustive branch-and-bound algorithm, these estimates enable fast description-oriented community detection. This is demonstrated in an evaluation using five real-world data sets, obtained from three different social media applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Description-oriented community detection using exhaustive subgroup discovery

Communities can intuitively be defined as subsets of nodes of a graph with a dense structure in the corresponding subgraph. However, for mining such communities usually only structural aspects are taken into account. Typically, no concise nor easily interpretable community description is provided. For tackling this issue, this paper focuses on description-oriented community detection using subg...

متن کامل

Subgroup and Community Analytics on Attributed Graphs

Subgroup discovery and community detection are two approaches having been studied in different research areas like data mining and social network analysis. In this context, these techniques are especially helpful in order to provide for analytical and explorative data mining approaches. We present an organized picture of recent research in subgroup discovery and community detection specifically...

متن کامل

Descriptive Community Detection

Subgroup discovery and community detection are standard approaches for identifying (cohesive) subgroups. This paper presents an organized picture of recent research in descriptive community (and subgroup) detection. Here, it summarizes approaches for the identification of descriptive patterns targeting both static as well as dynamic (sequential) relations. We specifically focus on attributed gr...

متن کامل

Detection of avian leukosis virus subgroup J in albumen of commercial and native fowl eggs using RT-PCR in Fars province of Iran

The subgroup J of ALV (ALV-J) has emerged as an important pathogen of meat-type chickens since1989. This virus is responsible for economic losses due to both mortality and depressed performance inchickens. So, the objective of this study is the detection of ALV-J in the albumen of commercial and nativefowl eggs using RT-PCR. Three hundred and seventy egg albumens were randomly selected from dif...

متن کامل

Local Patterns: Theory and Practice of Constraint-Based Relational Subgroup Discovery

This paper investigates local patterns in the multi-relational constraint-based data mining framework. Given this framework, it contributes to the theory of local patterns by providing the definition of local patterns, and a set of objective and subjective measures for evaluating the quality of induced patterns. These notions are illustrated on a description task of subgroup discovery, taking a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015